Markov decision process

Results: 537



#Item
21Field theory / Mathematics / Algebraic geometry / Valuation / Markov decision process / Probability theory / Logic / Natural deduction

On Structural Properties of MDPs that Bound Loss due to Shallow Planning 1 Nan Jiang1 and Satinder Singh1 and Ambuj Tewari2 Computer Science and Engineering, University of Michigan 2

Add to Reading List

Source URL: dept.stat.lsa.umich.edu

Language: English - Date: 2016-04-20 13:16:34
22Statistics / Graphical models / Probability / Estimation theory / Robot control / Dynamic Bayesian network / Probability theory / Bayesian network / Causality / Particle filter / Partially observable Markov decision process / Belief propagation

Journal of Artificial Intelligence Research–1178 Submitted 12/15; publishedExploiting Causality for Selective Belief Filtering in Dynamic Bayesian Networks

Add to Reading List

Source URL: jair.org

Language: English - Date: 2016-04-28 15:06:13
23Statistics / Game theory / Statistical inference / Dynamic programming / Markov processes / Stochastic control / Probability theory / Partially observable Markov decision process / Coalition / Cooperative game theory / Core / Bayesian game

Sequential Decision Making in Repeated Coalition Formation under Uncertainty Georgios Chalkiadakis Craig Boutilier

Add to Reading List

Source URL: www.intelligence.tuc.gr

Language: English - Date: 2008-02-08 15:14:59
24Statistics / Statistical theory / Estimation theory / Dynamic programming / Markov decision process / Stochastic control / Bias of an estimator / Reinforcement learning / Loss function / Fisher information

Bias in Natural Actor-Critic Algorithms Philip S. Thomas Department of Computer Science, University of Massachusetts, Amherst, MAUSA

Add to Reading List

Source URL: psthomas.com

Language: English - Date: 2012-10-01 18:27:53
25Computational complexity theory / Theory of computation / Dynamic programming / Markov decision process / Stochastic control / Analysis of algorithms / Mathematical logic / Reinforcement learning / Time complexity / Algorithm / PP

Verification of Markov Decision Processes using Learning Algorithms? Tom´asˇ Br´azdil1 , Krishnendu Chatterjee2 , Martin Chmel´ık2 , Vojtˇech Forejt3 , Jan Kˇret´ınsk´y2 , Marta Kwiatkowska3 , David Parker4 , a

Add to Reading List

Source URL: www.hieratic.eu

Language: English
26Dynamic programming / Partially observable Markov decision process / Stochastic control / Hierarchical task network / Robotics / Robot

Online Development of Assistive Robot Behaviors for Collaborative Manipulation and Human-Robot Teamwork Bradley Hayes and Brian Scassellati Dept. of Computer Science, Yale University Human-robot teaming has the potential

Add to Reading List

Source URL: bradhayes.info

Language: English - Date: 2016-07-11 15:51:46
27Artificial neural networks / Computational neuroscience / Cybernetics / Q-learning / Recurrent neural network / Reinforcement learning / Long short-term memory / DQN / Markov decision process / Sepp Hochreiter / Artificial intelligence / Quest

Language Understanding for Text-based Games using Deep Reinforcement Learning Karthik Narasimhan∗ CSAIL, MIT

Add to Reading List

Source URL: arxiv.org

Language: English - Date: 2015-09-14 21:25:04
28Game theory / Reinforcement learning / Nash equilibrium / Q-learning / Strategy / Partially observable Markov decision process / Action selection / Best response / Bellman equation / Zero-sum game / Agent-based model / Solution concept

Coordination in Multiagent Reinforcement Learning: A Bayesian Approach Georgios Chalkiadakis Craig Boutilier

Add to Reading List

Source URL: www.intelligence.tuc.gr

Language: English - Date: 2009-03-02 16:24:03
29Computational neuroscience / Belief revision / Reinforcement learning / Computational statistics / Q-learning / Temporal difference learning / Artificial neural network / Machine learning / Markov decision process / Mathematical optimization / Algorithm / Gradient descent

Sutton, Richard PIN

Add to Reading List

Source URL: webdocs.cs.ualberta.ca

Language: English - Date: 2013-10-18 16:05:54
30Operations research / Dynamic programming / Stochastic control / Markov processes / Reinforcement learning / Markov decision process / Valuation / Algorithm / Mathematical optimization

Learning from Demonstrations: Is It Worth Estimating a Reward Function? Bilal Piot1,2 , Matthieu Geist1 , Olivier Pietquin1,2 1 Supélec, IMS-MaLIS Research group, France

Add to Reading List

Source URL: www.ilhaire.eu

Language: English - Date: 2013-10-03 05:33:46
UPDATE